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ABSTRACT 

It is a central issue to elucidate the new type of molec- 
ular recognition accompanied by a global structural 
change of a molecule upon binding to its targets. 
Here we investigate the driving force for the binding 
of R12 (a ribonucleic acid aptamer) and P16 (a partial 
peptide of a prion protein) during which P1 6 exhibits 
the global structural change. We calculate changes in 
thermodynamic quantities upon the R1 2-P1 6 binding 
using a statistical-mechanical approach combined 
with molecular models for water which is currently 
best suited to studies on hydration of biomolecules. 
The binding is driven by a water-entropy gain origi- 
nating primarily from an increase in the total volume 
available to the translational displacement of water 
molecules in the system. The energy decrease due 
to the gain of R12-P16 attractive (van der Waals and 
electrostatic) interactions is almost canceled out by 
the energy increase related to the loss of R12-water 
and P16-water attractive interactions. We can ex- 
plain the general experimental result that stacking 
of flat moieties, hydrogen bonding and molecular- 
shape and electrostatic complementarities are fre- 
quently observed in the complexes. It is argued that 
the water-entropy gain is largely influenced by the 
geometric characteristics (overall shapes, sizes and 
detailed polyatomic structures) of the biomolecules. 

INTRODUCTION 

An intrinsically disordered protein is characterized by a 
lack of stable tertiary structure when it is present as an 
isolated polypeptide chain in aqueous solution under the 
physiological condition. Its folding is coupled with binding 
to its targets in the sense that it constructs a well-defined 
structure only after the binding is implemented (1). An ap- 
tamer, which is a folded single-stranded nucleic acid, pos- 



sesses structural plasticity and binds to its targets by chang- 
ing its structure in accordance with the target structures 

(2) . The structure of the aptamer remains unchanged in 
some cases, and disordered, flexible portions of its targets 
become structured upon binding. This type of molecular 
recognition accompanied by a global structural change of 
a molecule upon binding to its targets appears to be sub- 
stantially different from that described by the 'lock and key' 

(3) or 'induced-fit' (4) binding model. Elucidation of these 
molecular-recognition mechanisms is a central issue in bio- 
physics, biochemistry and structural biology. In our opin- 
ion, all types of molecular recognitions share the mecha- 
nism based on the same physicochemical origin and can be 
explained within the same theoretical framework in a uni- 
fied manner. The present study provides a remarkable first 
step toward achieving such elucidation by considering a ri- 
bonucleic acid (RNA) aptamer (5) as an important example. 

Aptamers are RNAs and deoxyribonucleic acids origi- 
nating from in vitro selection experiments which optimize 
the nucleic acids for high-affinity binding to their targets 
by starting from random sequence libraries (2,6,7). Up to 
now, many aptamers have been developed against a wide 
variety of targets such as small molecules (8), small inter- 
fering RNA (siRNAs) (9), peptides (10), sugars (11,12), 
proteins (13-17) and cells (18,19). Aptamers have many 
potential uses as effective therapeutics (20), drug delivery 
agents (21), purification agents for the therapeutic anti- 
bodies (16) and molecular probes (22). Elucidation of the 
molecular-recognition mechanism common in all of the ap- 
tamers presents much challenge not only from the scien- 
tific viewpoint but also for medical and technological ap- 
plications. The RNA aptamer considered in the present 
study, r(GGAGGAGGAGGA) (R12), binds to and stabi- 
lizes a normal cellular form of a prion protein (PrP c ) (23), 
which is expected to prevent prion diseases (5). R12 takes 
a unique quadruplex structure forming a dimer (23). In 
this case, two portions (PI and PI 6) of the N- terminal half 
of PrP c are disordered and flexible, and they form well- 
defined structures in accordance with R12 upon binding. 
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Figure 1. Binding of two PI 6s to an R12 dimer. The complex formed is 
denoted by '2xR12:2xP16'. The water molecules are omitted here. 



Mashima et al. (5) performed experiments for binding of 
PI 6 (Gly-Gln-Trp-Asn-Lys-Pro-Ser-Lys-Pro-Lys-Thr-Asn) 
to R12, and we consider this fundamental binding process. 
In this process, two PI 6s bind to the R12 molecules form- 
ing the dimer, respectively, as illustrated in Figure 1 . Since 
the upper and lower halves of the 2xR12:2xP16 complex 
share essentially the same structure, the theoretical investi- 
gation can be made for the constituent process in which a 
PI 6 binds to one of the R12 molecules, forming an R12:P16 
complex. 

The clue to the molecular-recognition mechanism is the 
driving force for the binding. The force has been investi- 
gated primarily from the viewpoint of structural biology 
on the basis of the three-dimensional (3D) structure of the 
aptamer-target complex. A prevailing view is that an ap- 
tamer binds to its targets via aptamer-target electrostatic at- 
tractive interactions due to the contact of groups with pos- 
itive and negative charges, that is, the electrostatic comple- 
mentarity. Actually, in several aptamer-target complexes, 
negatively charged phosphate backbones in the aptamer are 
in contact with positively charged moieties of the targets. 
For example, Toggle-25t (an RNA aptamer) binds to the 
positively charged arginine-rich surface of thrombin (15). 
In the R12:P16 complex, the contact of phosphate groups 
of R12 with lysine residues of PI 6 is observed as illustrated 
in Figure 2 (5) (Figures 2, 4 and 5 are drawn using the 
Visual Molecular Dynamics (VMD)(24)). However, there 
is an experimental result that raises a doubt with respect 
to this view: in the complex of Apt8 (an RNA aptamer) 
binding to the Fc fragment of Human IgGl (hFcl), the 
aptamer-bound area of hFcl consists of a less-positively 
charged surface (16,25). In the case of the R12-P 16 binding, 
a it-it stacking interaction between a guanine nucleotide in 
R12 and an indole ring of tryptophan in PI 6 (see Figure 
2) has been proposed as another important driving force 
(5). Taken together, the structural data have shown that pre- 
cise stacking of flat moieties, specific hydrogen bonding and 
electrostatic complementarity frequently occur in the com- 
plexes (2). 

The determination of the complex structure is an indis- 
pensable starting step toward the clarification of the driv- 
ing force for the binding. However, it is not sufficient and 
to be followed by studies from the viewpoint of statistical 
thermodynamics of hydration. This is because the binding 
occurs in aqueous solution and water should play impera- 
tive roles. There are two principal points to be taken into ac- 
count as illustrated in Figure 3. In the figure, the two solute 
molecules are rigid and their structures remain unchanged 
upon binding in (a), whereas one of them is structureless 
and becomes structured upon binding in (b). Firstly, the 
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Figure 2. Structural images of the upper half of the R12:P16 complex. 
It is taken from an NMR structure (model 1) constructed as described in 
'Structure Modeling for R12:P 16 Complex'. R12 and P16 are drawn on the 
upper and lower sides, respectively, using the space-filled model, (a) Con- 
tact of a phosphate group in R12 with a lysine residue in PI 6. The phos- 
phate backbone (CS'-C^-CS'-PO 4- ) and Lys8 are depicted by the licorice 
model, (b) tt-tt stacking of a guanine nucleotide in R12 and an indole ring 
of tryptophan in P16. The quadruplex tetrad plane containing G:G:G:G 
(G denotes a guanine nucleotide) and Trp3 are depicted by the licorice 
model. The water molecules are omitted here. This figure is drawn using 
the VMD 



two molecules are hydrated, and the regions that become 
unexposed to water upon binding undergo dehydrations. 
When a solute region is hydrated, unless the region is pre- 
dominantly nonpolar, the water structure near the region 
is significantly perturbed due to solute-water electrostatic 
and van der Waals attractive interactions. Upon dehydra- 
tion, the attractive interactions are lost, giving rise to an 
energy increase. At the same time, the water structure is 
reorganized and the water-water electrostatic and van der 
Waals attractive interactions are recovered, leading to an 
energy decrease. The energy increase is approximately two 
times larger than the energy decrease (26,27) and the net 
change in energy is positive and considerably large, which is 
referred to as 'energetic dehydration effect' hereafter. Since 
a hydrogen bond is usually expressed as an electrostatic at- 
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Figure 3. Cartoons illustrating the dehydration and the overlap of the ex- 
cluded spaces accompanying the binding of two solute molecules, (a) Case 
where they are both rigid and their structures remain unchanged, (b) Case 
where one of them is flexible (i.e. structureless) and becomes structured 
after the binding. 



tractive interaction between hydrogen and oxygen or ni- 
trogen, we consider that solute-solute, solute-water and 
water-water hydrogen bonds can be regarded as electro- 
static attractive interactions. Secondly, each molecule gener- 
ates an excluded space where the centers of water molecules 
cannot enter. The volume of the excluded space is referred 
to as the excluded volume (EV). Upon binding, the two ex- 
cluded spaces overlap and the total EV decreases by the 
overlapped volume as shown in Figure 3a, which is followed 
by a corresponding increase in the total volume available 
to the translational displacement of water molecules in the 
system. Hence, the binding leads to a large gain of the con- 
figurational entropy of water. When the structure of one of 
the molecules changes to a more compact one upon binding 
as in the case of Figure 3b, the water-entropy gain becomes 
considerably larger than in the case of Figure 3a, because 
such a structural change results in a larger decrease in the 
total EV. We have shown that the physical factors described 
above, which can be categorized as 'entropic EV effect', play 
essential roles in a variety of biological self-assembly pro- 
cesses (28-32). 

In the present study, we revisit the R12-P16 binding from 
the viewpoint of statistical thermodynamics of hydration. 
Since PI 6 does not take a particular tertiary structure when 
it is isolated, the binding is accompanied by a global struc- 
tural change of PI 6. We remark that obtaining only mi- 
croscopic information using a computer (e.g. molecular dy- 
namics) simulation does not yield the clarification of the 



driving force. Calculating the changes of thermodynamic 
quantities upon binding is vitally important. We calculate 
the binding free energy using the molecular mechanics, hy- 
brid method (30-37) in which the angle-dependent integral 
equation theory (ADIET) (38^2) applied to a multipo- 
lar water model (38,39) is combined with the morphome- 
tric approach (MA) (43,44) and 3D reference interaction 
site model (3D-RISM) theory (45-48). The ADIET and 
the 3D-RISM theory are statistical-mechanical theories for 
molecular liquids and hydration of solutes. The combina- 
tion with the MA has recently been developed by us so that 
the ADIET can be applied to a complex biomolecule with 
drastic reduction in the computational load. The hybrid 
method and the 3D-RISM theory, which have been quite 
successful in analyzing a variety of self-assembly processes 
(27-32,49-52), are best suited to calculations of the hydra- 
tion energy and entropy, respectively. 

Our important findings are as follows. The binding is 
driven by a large gain of the water entropy; and the en- 
ergy decrease due to the gain of R12-P16 attractive interac- 
tions is almost canceled out by the energy increase originat- 
ing from the energetic dehydration effect upon binding. It 
should be noted, however, that the gain of R12-P16 attrac- 
tive interactions is necessitated for compensating for the en- 
ergetic dehydration effect. We point out that the general ex- 
perimental observations manifesting precise stacking of flat 
moieties, electrostatic complementarity, including specific 
hydrogen bonding, and molecular shape complementarity 
can reasonably be interpreted. Further, we argue that the 
geometric characteristics (overall shapes, sizes and detailed 
polyatomic structures) of the solute molecules become the 
most important factors when the entropic EV effect domi- 
nates. This argument sheds new light on the elucidation of 
all types of molecular-recognition mechanisms within the 
same theoretical framework in a unified manner. 

MODEL AND THEORY 

Free-energy function 

For a solute molecule immersed in water at infinite dilution, 
we define the free-energy function G expressed as 



G = E c - TSq + /x H , 



(1) 



where Eq and Sq are the conformational (intramolecular) 
energy and entropy of the solute molecule, respectively, T 
is the absolute temperature and /xh is the hydration free 
energy defined as the excess chemical potential of the so- 
lute. We note that /x H is independent of the solute insertion 
condition, isobaric or isochoric (53), and we consider iso- 
choric condition that is much more convenient in a theoret- 
ical treatment. Using the relation, 

Mh = £vh - TSyn, (2) 

where the subscript 4 VH' denotes the hydration under iso- 
choric condition, €vh an <3 Svh are the hydration energy and 
entropy, respectively, Equation (1) is rewritten as 



Eq — TSq + £VH — TSyft. 



(3) 



Here, €vh represents the solute-water interaction energy 
generated plus the energy change due to the structural re- 
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organization of water upon solute insertion and S V h repre- 
sents the change in the water entropy upon solute insertion. 
G is also independent of the solute insertion condition. 

E c is calculated on the basis of a molecular mechanical 
potential and thereby decomposed into the bonded, non- 
electrostatic and electrostatic parts as 

E C = E B + E U + E ES , (4) 

where E B is the bond-type energy comprising the bond- 
stretching, angle-bending and torsion-angle energies, Eu is 
the Lennard- Jones (LJ) interaction energy and E E s is the 
electrostatic interaction energy. We also decompose € V h as 

£VH = £VH,LJ + £VH,ES, (5) 

where €vh, lj and €vh, es are the nonelectro static and elec- 
trostatic contributions to 6vh ? respectively. The decomposi- 
tion is made in the following manner. First, we calculate the 
hydration energy of a hypothetical solute molecule whose 
partial charges are all switched to zero, €vh, lj- Then, we 
obtain e V h, es from e V h, es = ^vh - ^vh, lj. 

Substituting Equations (4) and (5) into Equation (3) 
yields 

G = £ t otai - TSc - TSyn, (6a) 

^total = Eb + (E U + £VH,Ll) + (^ES + ^VH,Es). (6b) 

^totah which is referred to as the total energy hereafter, con- 
sists of E B and the LJ and electrostatic components denoted 
by E u + 6 V h, lj and E ES + e V H, es, respectively. 



Paths for calculating binding free energy 

The free-energy function and its constituents defined above 
are applicable to R12, P16 and the R12:P16 complex. Fig- 
ure 4 shows a schematic representation of the paths con- 
sidered for calculating the binding free energy, the free- 
energy change upon the R12-P16 binding. In path I, R12 
and PI 6 in the complex are simply separated with no struc- 
tural changes, and the resulting structures are employed 
as the isolated molecules. That is, it is assumed that both 
of R12 and PI 6 isolated possess unique structures with 
no structural fluctuations. In the real system, isolated PI 6 
takes an ensemble of rather extended structures and un- 
dergoes a global structural change upon binding though 
R12 exhibits almost no change in its structure (5,23). This 
binding process is mimicked by path II. Path III represents 
the structural change of PI 6 from the unstructured state 
(P16 ran dom coils- it is assumed to be a set of random coils in 
the present study) to the compact structure taken from the 
complex (P16 compact ). 

Let A M X{M = I, II, III) denote the change in X for path 
M. The binding free energies, which can be considered for 
paths I and II, are given by A 1 G and A 11 G, respectively. They 
are formally expressed as 

A l G = G(R12 : P16) - {G(R12) + G(P16 compact )} 

= A^lj + £ V H,LJ) + A^es + £vh,es) " TA 1 S\m,(7) 



P1 6(Compact) P1 6(Unstructured) 




R12 R12 




P16(Compact) -R12 

Figure 4. Schematic representation of the paths considered for calculating 
the binding free energy, the free-energy change upon the R12-P16 binding. 
The R12:P16 complex indicated by 'P16(Compact)»R12' is taken from an 
NMR structure (model 1) constructed as described in 'Structure Model- 
ing for R12:P16 Complex'. R12 and PI 6 are drawn on the upper and lower 
sides, respectively, using the space-filled model. A 1 G, A U G and A 1U G de- 
note changes in the free-energy function along paths I, II and III, respec- 
tively. The water molecules are omitted here. This figure is drawn using the 
VMD. 

A n G = G(R12 : P16) - {G(R12) + G(P16 randomcoils )} 

= A U E B + A n (£ LJ + £ V H,LJ) + A n (£ ES + £ V H,ES) 

-TA ll Sc- rA n 5v H . (8) 

G(R12), for example, is the free-energy function for R12. 
Z^i^ES? ^^vh, lj and ^^vh, es ? for example, originate from 
electrostatic attractive interactions between R12 and PI 6, 
from the energetic dehydration effect due to the loss of R12- 
water and P16-water van der Waals attractive interactions 
and from the energetic dehydration effect due to the loss 
of R12-water and P16-water electrostatic attractive interac- 
tions, respectively, and ^^vh represents the water-entropy 
gain in the binding process of path I. We remark that about 
half of the energy increase arising from the loss of R12- 
water and P16-water attractive interactions is canceled out 
by the energy decrease due to the structural reorganization 
of water upon binding (see the fourth paragraph of the In- 
troduction section). In path I, A l E B = 0 and Z^Sc = 0. The 
free-energy change due to the global structural change of 
PI 6, A lll G, is given by 

A IH G = G(P16 com pact) — G(P16 ran domcoils). 

= A lU E B + A m (£ LJ + £ V H,LJ) + A m (£ ES + £ V H,ES) 

-TA ul S c -TA m S YH . (9) 

It should be noted that A ll G = A l G + A lll G and A ll S c = 
A™S C . 
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Since P16 ran dom coils is more stable than P16 com pact, A m G 
> 0. A ll G < 0 and it follows that A l G < 0 and A l G > 
A ll G. A ll S c < 0 and -TA ll S c ( = -TA m S c ) cannot be 
a driving force for the binding. A 11 Eb ( = A m E^) should 
have a relatively small contribution to the binding free en- 
ergy. We present the theoretical results for A ll G and A m G 
by omitting — TA ll Sc for the following reasons: the calcu- 
lation of conformational entropy is a formidable task and 
modeling the unstructured state of isolated PI 6 is accompa- 
nied by significant uncertainty. However, the validity of our 
theoretical method can be checked in the following man- 
ner. The dissociation constant for the R12-P 16 binding was 
determined by Mashima et al. (5) through a filter binding 
assay experiment by assuming that PI 6 bound to an R12 
monomer. However, R12 is present as a dimer and two PI 6s 
bind to the R12 molecules forming the dimer, respectively 
(see Figure 1). In the present study, we recalculate the dis- 
sociation constant (the recalculated one is denoted by k^>) 
as described in Section A of Supplementary Appendices 
and obtain an approximate value of the binding free energy 
corresponding to A ll G as — RTin(kv) that is denoted by 
^ n ^experimental (R is the gas constant). The theoretical value 
of A ll G in which — TA u Sq is omitted, ^ 11 (^theoretical, must 
be sufficiently lower than ^(^experimental- This criterion is 
certainly satisfied as shown in the Results and Discussion 
section (— TA u Sq is roughly estimated and a discussion is 
also made on the quantitative aspect of the theoretical re- 
sults). 

Our principal concern is to examine the signs and mag- 
nitudes of A M (E U + € VH ,lj), A M (E ES + evH, ES ) and 
— TA M Syn (M = I, II) which enable us to specify the driving 
force. Those for path I may be better suited to quantitative 
analyses because the binding free energy for path I given by 
Equation (7) does not include changes in the bond-type en- 
ergy and the solute conformational entropy. 

Calculation of hydration entropy 

The ADIET (38^2) applied to a multipolar model for wa- 
ter (38,39) has been quite successful in studies on hydropho- 
bic and hydrophilic hydrations. In the theory, the orienta- 
tional dependency of the water-water and solute-water in- 
teraction potentials and correlations is explicitly taken into 
consideration. In most of our studies, a water molecule is 
modeled as a hard sphere with diameter d$ = 0.28 nm in 
which a point dipole and a point quadrupole of tetrahedral 
symmetry are embedded. The effect of the molecular polar- 
izability is taken into account using the self-consistent mean 
field (SCMF) theory (38,39). At the SCMF level, the many- 
body-induced interactions are reduced to pairwise additive 
potentials involving an effective dipole moment. The effec- 
tive dipole moment thus determined is about 1.42 times 
larger than the bare gas-phase dipole moment. The dielec- 
tric constant of bulk water, which is a good measure of the 
validity of a theory, is calculated to be ~83 that is in good 
agreement with the experimental value ~78. Further, the 
theory is capable of reproducing the solubility of methane 
featuring the minimum at about 350 K (41). The hydration 
free energy of a spherical, nonpolar solute calculated is in 
perfect agreement with that obtained by computer simula- 
tions using more popular water models such as the trans- 



ferable intermolecular potential four point (TIP4P) and the 
extended single point charge (SPC/E) (41). A more detailed 
description of the ADIET is given in Section B of Supple- 
mentary Appendices. 

It is not straightforward to apply the ADIET to a large, 
complex solute with polyatomic structure due to the mathe- 
matical complexity. Fortunately, as far as the hydration en- 
tropy SVh is concerned, it has been shown to be rather in- 
sensitive to the solute-water interaction potentials. For ex- 
ample, Imai et al (47) examined S V h for a total of eight 
peptides and proteins using the 3D-RISM theory combined 
with the all-atom potentials and found the following: even 
when the protein-water electrostatic interactions, which are 
quite strong, are completely shut off and only the LJ inter- 
actions are retained, SVh changes merely by less than 5%. 
Thus, only the geometric characteristics are essential in the 
calculation of SVh, and a solute molecule can be modeled 
as a set of fused, neutral hard spheres. The diameter of an 
atom in a solute molecule is set at the corresponding a -value 
for the LJ potential. As a consequence, the MA (43,44) be- 
comes a very powerful tool. We employ the hybrid method 
in which the ADIET is combined with the MA for calculat- 
ing SVh- 

The idea of the MA is to express SVh by the linear combi- 
nation of only four geometric measures of a solute molecule: 

SvhAb = Q F ex + C 2 A+ C 3 X+ C 4 Y. (10) 

Here, Equation (10) is referred to as the morphometric 
form, V QX is the EV generated by the solute molecule, A is 
the water-accessible surface area, Xand Tare the integrated 
mean and Gaussian curvatures of the water-accessible sur- 
face, respectively, and k# is the Boltzmann constant. The 
water-accessible surface is the surface that is accessible to 
the centers of water molecules. The EV, as defined in the pre- 
vious section, is the volume that is enclosed by this surface. 
We note that C\ is completely independent of the solute- 
water interaction potential. Though SVh is influenced by 
all the four terms, C\ V ex is the principal term at normal 
temperature and pressure. This is the reason for the fair in- 
sensitivity of SVh to the solute-water interaction potential. 
The contribution from the water molecules near the solute 
molecule is represented by the other three terms. In the MA, 
the solute shape enters SVh only via the four geometric mea- 
sures. Therefore, the four coefficients (Q-C4) can be deter- 
mined in simple geometries: they are calculated from the 
values of *SVh of hard-sphere solutes immersed in our model 
water. The ADIET is employed in the calculation. The four 
coefficients are determined by the least square fitting ap- 
plied to the following equation (i.e. Equation 10 applied to 
hard-sphere solutes): 

S\h/ kn = 

Ci(4tt7? 3 /3) + C 2 (47tR 2 ) + C 3 (4n R) + C 4 (4tt), (11) 
R = (du + d s )/2. 

Here, d\j denotes the hard-sphere diameter and sufficiently 
many different values of d\j are considered. T is set at 298 
K, and the number density of bulk water ps is taken to be 
that of real water on the saturation curve, ps = 0.0333 A -3 . 
Once the four coefficients are determined, S V h of the solute 
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molecule with a prescribed structure is obtained from Equa- 
tion (10) only if the four geometric measures are calculated. 

The high reliability of the hybrid method in calculat- 
ing Syn has been demonstrated in the following examples 
wherein the water-entropy effect is treated as the key fac- 
tor: quantitative reproduction of the experimentally mea- 
sured changes in thermodynamic quantities upon apoplas- 
tocyanin (apoPC) folding (30); elucidation of the micro- 
scopic mechanisms of pressure (33) and cold (34,35) denat- 
urating of a protein; and proposal of a reliable measure of 
the thermodynamic stability of a protein (36,37). 



Calculation of hydration energy 

The hydration energy 6vh ? which is largely dependent on 
the solute-water interaction potential, cannot be calculated 
by the hybrid method described above. We employ the 3D- 
RISM theory (45-48) for the calculation of e V H. The LJ po- 
tential parameters and partial charges are assigned to the 
atoms constituting the solute molecules. The 3D-RISM the- 
ory has successfully been applied to important problems 
in biological systems such as the hydration properties of 
peptides and proteins (27,47), receptor-ligand binding pro- 
cesses (49,50) and association of protein molecules (51,52). 
However, the theory is not very good at elucidating the hy- 
dration of a hydrophobic molecule. For instance, the the- 
ory underestimates the absolute value of the water-entropy 
change upon protein folding though the hydration-energy 
change remains quantitatively reliable (27). Taken together, 
the 3D-RISM theory and the hybrid method are best suited 
to calculations of €vh and SVh, respectively. 

A description of the 3D-RISM theory is provided in Sec- 
tion C of Supplementary Appendices. First, the site-site 
correlation functions for bulk water are calculated using 
the dielectrically consistent RISM (DRISM) theory (54,55) 
coupled with the Kovalenko-Hirata (KH) closure equation 
(46). The site-site intermolecular potentials, water density 
(p s = 0.0333 A" 3 ) and absolute temperature (T = 298 K) 
are served as the input data. The SPC/E model (56) is em- 
ployed for water with a correction in terms of the LJ po- 
tential parameters for the hydrogen sites (a = 0.654 A, 
6 = 0.0155 kcal/mol). The water-solute correlation func- 
tions are then obtained for a solute molecule with a pre- 
scribed structure by solving the 3D-RISM/KH equations. 
The site-site correlation functions for bulk water and the 
water-solute interaction potentials form the input data. The 
LJ potential parameters and partial charges for the solute 
atoms are taken from the standard Amber99SB force field 
(57). The solution is performed on a 3D cubic grid. The 
grid spacing (Ax, Ay and Az) is set at 0.5 A, and the grid 
resolution (N x xN y xN z ) is 256 x 256 x 256. It has been 
confirmed that the spacing is sufficiently small and the box 
size (N x Ax, N y Ay, N z Az) is large enough for the result ob- 
tained to be identical within convergence tolerance. The hy- 
dration free energy /xh is calculated using the 3D extension 
of the Singer-Chandler formula adapted to the KH closure 
equation (46,58). Finally, €vh is obtained from e V H = /^h + 
TSyn, where SVh is numerically evaluated as the tempera- 
ture derivative of /x H - 



Structure modeling of R12:P16 complex 

The ten 3D structures of the 2xR12:2xP16 complex were 
taken from the Protein Data Bank (PDB code: 2RSK). 
They were determined by Mashima et al (5) using a simu- 
lated annealing protocol combined with the interatomic dis- 
tance and dihedral angle constraints obtained from the nu- 
clear magnetic resonance (NMR) experiments. In the simu- 
lated annealing protocol, the a -values in the LJ potentials 
of the Amber99SB force field (57) were multiplied by 0.7 to 
widely explore the structural space for the complex. 

We employ an advanced molecular mechanics refinement 
procedure using the Amber99SB force field with the full o- 
values. This procedure, which utilizes the generalized Born 
and surface area (GB/SA) model solvent (59,60), is com- 
bined with the interatomic distance constraints obtained 
from the nuclear Overhauser effect spectroscopy (NOESY). 
NOESY intensities are taken into account as the limits of 
interatomic distances. These limits are about 1.4 ( = 1/0.7) 
times larger than those employed by Mashima et al in the 
previous study (5). The ten structures are refined with 100 
steps of the steepest-descent energy minimization followed 
by the conjugated gradient minimization which is contin- 
ued until the root mean square forces acting on the com- 
plex atoms become weaker than 1.0 x 10~ 4 (kcal/mol)/A. 
All the calculations are carried out using the AMBER 12 
molecular dynamics package (61). 

Although the effect of water is further taken into account 
in the present refinement procedure, the important struc- 
tural features of the 3D structures determined by Mashima 
et al (5) are conserved quite well. Example common fea- 
tures are the stacking between a guanine nucleotide in R12 
and an indole ring of tryptophan in PI 6 and the contact of 
phosphate groups in R12 with lysine residues in PI 6 (see 
Figure 2). 

Since the upper and lower halves of the 2xR12:2xP16 
complex share almost the same 3D structure, the upper half 
is taken from each complex structure thus constructed and 
used as the structure of the R12:P16 complex. We calculate 
the free-energy function where the conformational entropy 
is set at zero for each model structure of the R12:P16 com- 
plex. The ten models obtained are ranked in ascending or- 
der with respect to this free-energy function, and the first to 
fifth models (models 1 through 5) are chosen for the present 
study. The coordinates of the ensemble of five structures 
thus obtained for the 2xR12:2xP16 complex have been de- 
posited in the Protein Data Bank with the accession code 
2RU7. 

Structure modeling of P16 in unstructured state 

The unstructured state of PI 6 is modeled as a set of random 
coils. First, PI 6 is taken from the R12:P16 complex and its 
structure is refined with 100 steps of the steepest-descent 
energy minimization followed by the conjugated gradient 
minimization with the GB/SA model solvent which is con- 
tinued until the root mean square forces acting on the PI 6 
atoms become weaker than 1.0 x 10~ 4 (kcal/mol)/A. Then, 
starting from the structure thus obtained, we assign ran- 
dom numbers to the dihedral angles for the backbone chain, 
<P and ^. The random numbers are limited to the range 
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from -180° to -30° for <£> and from -180° to -150° and 
from —90° to 180° for ^ which correspond to the allowed 
regions in the Ramachandran map. For glycine, 0 is dis- 
tributed from -180° to -30° and from 30° to 180°, and V 
is from -180° to 180°. For proline, 0 is set at -65° and ^ 
is set at 180°. The 200 models, which are obtained in this 
way, are refined in accordance with the minimization tech- 
niques described above. The models that give divergently 
high LJ potential energies due to the overlap of the con- 
stituent atoms are then excluded, and the remaining 103 
models are employed as the random coils. All calculations 
are carried out using the AMBER 12 molecular dynamics 
package (61). The average value of the free-energy function 
for these models is regarded as G(P16 ran dom coils)- 

RESULTS AND DISCUSSION 

Gain of water entropy upon binding: driving force 

Table 1 gives the binding free energy and its two compo- 
nents, total-energy change and contribution from the water- 
entropy gain, upon the R12-P16 binding in path I (see Fig- 
ure 4). The binding free energy, which is in the range from 
—47.05 to —30.31 kcal/mol, is governed by the contribu- 
tion from the water-entropy gain for all the five models of 
the complex structure. The total-energy change is positive 
and against the binding for models 1, 3 and 4. It is nega- 
tive and in favor of the binding for models 2 and 5, but the 
absolute value of the contribution from the water-entropy 
gain is roughly an order of magnitude larger than that of 
the total-energy change. We can conclude that the binding 
is driven by the water-entropy gain. 

Table 2 gives the binding free energy and its two compo- 
nents upon the R12-P16 binding in path II. Unlike in path 

I, the structural change of PI 6 accompanying the binding 
is taken into account in path II. The binding free energy 
is in the range from —23.75 to —4.59 kcal/mol. The total- 
energy change takes a considerably large, positive value for 
all the five models of the complex structure, and it strongly 
opposes the binding. However, the water-entropy gain sur- 
passes the increase in the total energy and drives the bind- 
ing. Physicochemical interpretations of the results displayed 
in the two tables are provided in later sections, nullnull 

Comparison between theoretical and experimental values of 
binding free energy 

We compare the binding free energy calculated for path 

II, ^(/theoretical {A ll G in Table 2), with the experimen- 
tally determined value, 11 (^experimental- ^"^experimental can 
be obtained as — RTln(kv) where k^> is the dissociation 
constant for the R12-P16 binding: ^(^experimental = —6.45 
kcal/mol (see Section A of Supplementary Appendices). In 
a strict sense, ^ 11 (^theoretical cannot be in complete agree- 
ment With ^ H ^experimental - Tnis is because for ^"^experimental 

= — R7ln(kv) the standard state is 1 mol/1 and the activ- 
ity coefficient is set at unity while in the theoretical calcula- 
tion the standard state is the infinite dilution (62). Neverthe- 
less, the two values should be close to each other. As stated 
above, A 11 G m eoreticai in which — TA u Sq is omitted must be 
sufficiently lower than ^(^experimental- This requirement is 
certainly met except in the case of model 4. 



Here, it is worthwhile to roughly estimate — TA ll Sc- 
Since R12 undergoes essentially no structural change 
upon the R12-P16 binding, A u Sq is almost equal to the 
conformational-entropy loss for PI 6 caused by the transi- 
tion from the random-coil state to the compact structure, 
A lll Sc- Fitter (63) studied the conformational-entropy loss 
upon protein folding using neutron spectroscopy. He esti- 
mated the temperature dependence of the radius parameter 
r representing a length scale within which each residue can 
freely move. The loss is given by —?>k^N v \n{r u /rf) where r u 
and rf are the radius parameters for the unfolded and folded 
states, respectively, and N r is the number of residues. The 
estimation of the radius parameters was made at three tem- 
peratures: 303, 323 and 343 K. In order to obtain the loss 
at 298 K, we first performed the linear fitting of the tem- 
perature dependence of r u and rf, and then calculated their 
values at 298 K. The loss for PI 6 with N r = 12 was thus ob- 
tained: -TA ll S c = -TA lll S c ~ 8.83 kcal/mol. The neu- 
tron scattering experiments cover only the picosecond time 
regime although the fluctuations in other time scales also af- 
fect the loss (63). For this reason, our method based on Fit- 
ter's experiments tends to give significant underestimation 
of the loss. However, PI 6 possesses two proline residues, 
making its random coils rather compact, with the result of 
an exceptionally small loss. For this reason, it may be justi- 
fied to adopt the above result as a rough estimation. 

If the contribution from the conformational-entropy loss, 
— TA ll Sc = 8.83 kcal/mol, is incorporated in the theoret- 
ical calculation for the binding free energy, the result is in 
the range from —14.92 to 4.24 kcal/mol. The average value 
is —6.29 kcal/mol which is comparable with ^(^experimental 
= —6.45 kcal/mol. We can conclude that our theoretical re- 
sults are reliable even in a quantitative sense. (Model 4 might 
be inappropriate because it gives the binding free energy a 
positive value when -TA u Sq is incorporated.) 

Decomposition of total-energy change into various compo- 
nents 

The total-energy change upon the R12-P16 binding in path 
I is decomposed into A l E L j, A l €y^ lj, ^^es and ^^vh, es 
as displayed in Table 3. A l Eu and A l €yu,u are both sig- 
nificantly large, but the former is negative while the latter 
is positive, A l E^s and ^^vh, es are even much larger, but 
the former is negative while the latter is positive. A l E^j and 
^Wh, u or A l E E $ and ^evH, es are compensating. Thus, 
the binding gives rise to an energy decrease arising from van 
der Waals attractive interactions between R12 and PI 6, but 
it is accompanied by an energy increase related to the loss 
of R12-water and P16-water van der Waals attractive inter- 
actions. The former is slightly larger and A l (E^j + ^vh, lj) 
takes a rather small, negative value. The binding brings an 
energy decrease arising from electrostatic attractive inter- 
actions between R12 and PI 6, but it is accompanied by 
an energy increase related to the loss of R12-water and 
P16-water electrostatic attractive interactions. The latter is 
slightly larger and A l (E^s + ^vh, es) takes a rather small, 
positive value. As a matter of fact, the loss of R12-water 
and P16-water attractive interactions is quite large. How- 
ever, about half of it is canceled out by the water reorga- 
nization energy (26,27) (i.e. the energy lowering due to the 
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Table 1. Binding free energy and its energetic and entropic components (in kcal/mol) upon the R12-P16 binding in path I 



Model A l G ^totai -TA I Syu 



1 -36.16 5.47 -41.63 

2 -39.80 -3.10 -36.71 

3 -35.84 0.14 -35.98 

4 -30.31 3.43 -33.73 

5 -47.05 -6.58 -40.47 



Five different models, which correspond to the upper halves of the 2xR12:2xP16 complex structures in the Protein Data Bank with the accession code 
2RU7, are considered for the structure of the R12:P16 complex. 

Binding free energy, A l G = G(R12:P16) - {G(R12) + G(P16 compact )} = A l E total - TA l S YH . 
Total-energy change, Z^totai = A l (E u + <?vh, lj) + A l (E E s + 6 V h, es). 
Contribution from the water-entropy gain, — TA l S\n. 

Table 2. Binding free energy and its energetic and entropic components (in kcal/mol) upon the R12-P16 binding in path II 



Model A n G ^ n ^totai -TA u S Y h 



1 -21.67 36.26 -57.93 

2 -14.04 40.32 -54.36 

3 -23.75 27.43 -51.18 

4 -4.59 45.10 -49.70 

5 -11.53 48.84 -60.37 



Five different models, which correspond to the upper halves of the 2xR12:2xP16 complex structures in the Protein Data Bank with the accession code 
2RU7, are considered for the structure of the R12:P16 complex. 

Binding free energy, A ll G = G(R12:P16) - {G(R12) + G(P16 ran dom coils)}, where -TA ll S c is omitted: A ll G = A ll E totSi i - TA ll Syn. 
Total-energy change, A u E m ^ = A U E B + A U (E LJ + 6 V h, lj) + ^ n (^ES + ^vh, es)- 
Contribution from the water-entropy gain, — TA ii S\yl. 



structural reorganization of water upon binding) with the 
result that the total loss becomes comparable with the gain 
of R12-P16 attractive interactions. A l (E^j + ^vh, lj) and 
A 1 (Ees + £vh,es) are also compensating, leading to a con- 
siderably small total-energy change (it is either positive or 
negative), nullnull 

The energy decrease due to the gain of R12-P16 attrac- 
tive interactions is almost canceled out by the energy in- 
crease caused by the energetic dehydration effect. The bind- 
ing is driven by the entropic EV effect with the help of the 
gain of R12-P16 attractive interactions compensating the 
dehydrations. A loss of the conformational entropy of PI 6 
opposes the binding, but this effect is much smaller than the 
driving force, i.e. the water-entropy gain. 

Global structural change of P16 upon binding 

Table 4 gives the free-energy change, total-energy change 
and contribution from the water-entropy gain arising from 
the global structural change of PI 6 referred to as path III 
(see Figure 4). The reason for the significantly large total- 
energy change can be summarized as follows. The energy 
decrease, which arises from intramolecular van der Waals 
attractive interactions brought by the structural change, is 
somewhat compensated by the energy increase related to 
the loss of P16-water van der Waals attractive interactions. 
As a consequence, A m (E^j + €vh, lj) takes a rather small, 
negative value. By contrast, the energy increase related to 
the loss of P16-water electrostatic attractive interactions is 
considerably larger than the energy decrease arising from in- 
tramolecular electrostatic attractive interactions brought by 
the structural change. As a consequence, A ui (Ees + ^vh, es) 
takes a considerably large, positive value, giving rise to the 
significantly large total-energy change. The water-entropy 



gain contributes to the free-energy change as a negative 
component, but this effect is smaller than the positive total- 
energy change. Thus, the free-energy change takes a signif- 
icantly large, positive value. It increases further when the 
conformational-entropy loss, — TA m Sc = 8.83 kcal/mol, is 
incorporated. The positive free-energy change in path III is 
consistent with the experimental observation that isolated 
PI 6 is characterized by no well-defined structure, nullnull 

The large electrostatic dehydration effect can be in- 
terpreted as follows. The amino-acid sequence of PI 6 is 
Gly-Gln-Trp-Asn-Lys-Pro-Ser-Lys-Pro-Lys-Thr-Asn. Ex- 
cept for Gly, PI 6 possesses 11 side chains. Three of them 
(three lysine residues) are positively charged, six of them 
(Gin, Trp, Asn, Ser, Thr and Asn) are polar and only two 
of them (two proline residues) are nonpolar. PI 6 is thus 
highly hydrophilic and substantially more stabilized when 
the water-accessible surface area is large, i.e. it is rather ex- 
tended. 

Physical origins of water-entropy gains upon R12-P16 bind- 
ing 

The principal term in the right hand of Equation (10) is 
the first one dependent on the EV at normal temperature 
and pressure. The water-entropy gain upon the R12-P16 
binding can be discussed by decomposing it into the water- 
entropy gains accompanying paths III and I in Figure 4: 
(III) the global structural change of PI 6 from a random- 
coil state to the compact structure and (I) the contact of 
PI 6 possessing the compact structure with R12. In path 
III, many of the atoms in PI 6 contact one another, lead- 
ing to a large decrease in the EV. In path I, the atoms of 
PI 6 and R12 at their interface are closely packed, leading 
to an even larger decrease in the EV. The EV decreases by 
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Table 3. Components of total-energy change (in kcal/mol) upon the R12-P16 binding in path I 



Model 


^total 


A l {E u + 

^VH, Lj) 


A\E ES + 

^VH, ES) 




^VH, LJ 


A l E E s 


^VH, ES 


1 


5.47 


-3.94 


9.41 


-54.92 


50.98 


-1048.45 


1057.85 


2 


-3.10 


-6.33 


3.23 


-52.84 


46.51 


-1017.00 


1020.23 


3 


0.14 


-2.05 


2.19 


-50.55 


48.50 


-1070.42 


1072.61 


4 


3.43 


-5.04 


8.47 


-50.81 


45.77 


-1001.51 


1009.98 


5 


-6.58 


-9.55 


2.97 


-53.50 


43.95 


-1021.89 


1024.85 



Five different models, which correspond to the upper halves of the 2xR12:2xP16 complex structures in the Protein Data Bank with the accession code 
2RU7, are considered for the structure of the R12:P16 complex. 

Total-energy change, A 1 E ma \ = ^C^u + <?vh, lj) + ^(^es + ^vh, es), comprises A 1 E U , ^vh, lj, ^^es and ^vh, es- 

^C^lj + £vh, lj) is the LJ component. 

^C^es + <?vh, es) is the electrostatic component. 

A l E u , 

^^VH,LJ, 

A 1 Ees, and 

^^vh, es originate from van der Waals attractive interactions between R12 and P16, from electrostatic attractive interactions between R12 and P16, from 
the energetic dehydration effect related to the loss of R12-water and P16-water van der Waals attractive interactions and from the energetic dehydration 
effect related to the loss of R12-water and P16-water electrostatic attractive interactions, respectively. The contribution from the structural reorganization 
of water upon binding is included in Z^vh, lj and A l e\Yi, es- 

Table 4. Free-energy change and its energetic and entropic components (in kcal/mol) due to the global structural change of PI 6 upon the R12-P16 binding 
referred to as path III 



Model A lll G ^ m ^totai — TA lll Syu 



1 14.49 30.79 -16.30 

2 25.76 43.42 -17.65 

3 12.09 27.29 -15.19 

4 25.71 41.68 -15.96 

5 35.52 55.42 -19.90 



Five different models are considered for the structure of P16 com pact (P Incompact is taken from the R12:P16 complex). 

Free-energy change due to the global structural change of PI 6 upon the R12-P16 binding, A m G = G(P16 compact ) - G(P16 randomcoi i s ), where -TA m S c ( 
= -TA U S C ) is omitted: A m G = A m E totid -TA m Syn. 

Total-energy change, A m E toVdl = A m E B + A m (E u + 6 VH , lj) + A m (E ES + ^vh, es). 
Contribution from the water-entropy gain, — TA m Syn. 



12.49<is 3 (^s is the molecular diameter of water, 0.28 nm) 
in path III and by 33.33<is 3 in path I. These are the aver- 
age values for the five different models of the R12:P16 com- 
plex. Since A m V QX /d s 3 = -12.49, A l V QX /d s 3 = -33.33 and 
C\ <0, the R12-P16 binding accompanied by the structural 
change of PI 6 leads to a large gain of the water entropy. 

Changes in hydration entropy and enthalpy under isobaric 
condition 

The theoretical calculations are made under isochoric con- 
dition while the experiments are performed under isobaric 
condition. The changes in hydration quantities under the 
two conditions are related to one another as (53,64) 

AH/(k B T) = AW(M + (a*/K*)AV m /dl, (12a) 
AS m /k B = AS YH /k B + (a*/K* T )AV m /dl (12b) 

A m /(k B T) = Ae YH /(k B T) - ASVhMb = 
AH/(k B T)-AS m /k B . (13) 

Here, the subscript 'PH' denotes the hydration under iso- 
baric condition, H is the hydration enthalpy, A Fph is the 
system-volume change and d$ is the molecular diameter of 



water, a* and kt*, which depend only on the properties of 
pure water, are defined as 

a*=aT, (14a) 

K* T = K T k B T/dl (14b) 

where a is the isobaric thermal expansion coefficient and k t 
is the isothermal compressibility. A Fph can be calculated 
by the 3D-RISM theory (45-48). The values obtained are 
A V m /d s 3 = 5.51, 5.47 and -0.04 in paths I, II and III, re- 
spectively. k B T(a*/K T *)A Vvn/ds 3 in path I, for example, is 
only 2.92 kcal/mol that is far smaller than A l Syn, ^^vh, lj 
and ^^vh.es- Thus, the changes in hydration quantities 
under isobaric condition are almost indistinguishable from 
those under isochoric condition. 

It is worthwhile to comment on a general case. Upon 
binding of a hydrophobic receptor and a hydrophobic lig- 
and under isobaric condition, for example, the compression 
of bulk water occurs, possibly leading to a water-entropy 
loss (31): due to considerably large, negative AV?h, ASpyl 
can become negative despite positive ASyn. Of course, the 
water entropy always increases upon the binding under iso- 
choric condition. Thus, isochoric condition, which is free 
from the effects of compression or expansion of bulk water, 
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Figure 5. Image illustrating an example imperfection of the electrostatic 
complementarity in the R12:P16 complex. R12 and PI 6 are drawn on the 
upper and lower sides, respectively, using the space-filled model. An an- 
ionic C-terminal of PI 6 (Asnl2) is buried within the contact surface but 
there is no counter cationic group in the vicinity of the buried C-terminal. 
Asnl2 is depicted by the licorice model. The yellow ribbon represents the 
backbone of PI 6. The water molecules hydrating Asnl2 are depicted by 
the ball-and-stick model. This figure is drawn using the VMD. 



is more suited to the physical interpretation of a change in 
a thermodynamic quantity of hydration. 

Significance of electrostatic complementarity 

In the R12:P16 complex, the negatively charged phosphate 
groups of R12 are in contact with the positively charged 
e-amino groups of lysine residues of P16, leading to R12- 
P16 energetic stabilization brought by electrostatic attrac- 
tive interactions (see Figure 2) (5). This is a typical example 
of the electrostatic complementarity (2). Before the R12- 
P16 binding, the negatively and positively charged groups 
interact with water hydrogens with positive partial charges 
and with water oxygens with negative partial charges, re- 
spectively, maintaining R12-water and P16-water electro- 
static attractive interactions. The binding is driven by the 
water-entropy gain, but it is unavoidably accompanied by 
an energy increase arising from the loss of R12-water and 
P16-water electrostatic attractive interactions. About half 
of the energy increase is canceled out by the energy decrease 
due to the structural reorganization of water upon bind- 
ing (26,27), but the net effect acts as destabilization. This 
destabilization and the R12-P16 energetic stabilization are 
compensating. In other words, the latter is required for can- 
celing the former. Similar arguments can be made for gen- 
eral binding processes for two solute molecules, and this is 
why the electrostatic complementarity is often observed in a 
complex. Looking at the structural data for a complex with- 
out accounting for the water roles can mislead to the con- 
clusion that the R12-P16 binding, for instance, is driven by 
the R12-P16 electrostatic attractive interactions described 
above. 

There is a case where the formation of the complete elec- 
trostatic complementarity is not favorable in terms of the 
water entropy. In the R12:P16 complex, for example, an an- 
ionic C-terminal of PI 6 is buried within the contact surface 
but there is no counter cationic group in the vicinity of the 
buried C-terminal, as observed in Figure 5. Such incom- 
plete electrostatic complementarity, which makes a positive 
contribution to the total-energy change upon binding (the 
total energy is defined as the conformational energy plus 
the hydration energy), is accepted when the water-entropy 
gain dominates, or equivalently, when the priority should 
be given to the water-entropy gain. The incompleteness in 
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Figure 6. Water-entropy gain (/cb is the Boltzmann constant) upon the con- 
tact or stacking of two solutes immersed in water, (a) Contact of two spher- 
ical solutes with diameter = 10<is O^s denotes the water diameter), (b) 
Stacking of two disc-shaped solutes with surface diameter D = lOds- (c) 
Stacking of two R12 monomers each of which has disc-like shape with the 
span Z)'~9<is- The solutes in (a) and (b) possess no polyatomic structures 
whereas those in (c) possess them. 



the electrostatic complementarity occurs in such a case, and 
the Apt8-hFcl binding (16,25) mentioned in the Introduc- 
tion section also provides a good example. 

A hydrogen bond is usually expressed as an electrostatic 
attractive interaction between hydrogen and oxygen or ni- 
trogen. The binding is accompanied by the break of solute- 
water hydrogen bonds compensated with the formation of 
solute-solute hydrogen bonds. Thus, the significance of the 
specific hydrogen bonding between the solute molecules 
upon binding can be discussed in a similar manner. 

Crucial importance of geometric characteristics of solute 
molecules 

The entropic EV effect becomes stronger as the molecular 
size of solvent decreases and/or the solvent number den- 
sity increases. Thanks to the hydrogen bonding, water can 
exist in liquid state at ambient temperature and pressure 
despite its exceptionally small molecular size. The effect, 
which is relevant to the translational displacement of sol- 
vent molecules, becomes the largest when the solvent is wa- 
ter. The effect of the translational, configurational entropy 
of water plays imperative roles in sustaining life (65-67). 

When two large spherical solutes with diameter con- 
tact each other as shown in Figure 6a, the water-entropy 
gain is ~6&b and increases in proportion to d^/ds (ds is 
the water diameter, 0.28 nm) (29). At T= 298 K, k^T cor- 
responds to ~— 0.6 kcal/mol in terms of the contribution 
to the free-energy change. On the other hand, the water- 
entropy gain reaching ~60&b occurs upon the stacking of 
two large disc-shaped solutes with the surface diameter D, 



Nucleic Acids Research, 2014, Vol 42, No. 11 6871 



Structural 
CP q change 

P16 ~30/<b 0 




O monomer Q 




O 



Water molecule 

^ o ^ 

Stacking R12:P16 
-75/Cb O com P |ex 




Total water-entropy gain ~105/cb 



Figure 7. Water-entropy gain (k^ is the Boltzmann constant) upon each 
elemental process of the R12-P16 binding. The total water-entropy gain is 
also given. 



and the gain increases in proportion to (D/d$) 2 (see Figure 
6b) (29). It is assumed that the spherical and disc-shaped 
solutes possess no polyatomic structures. Thus, the water- 
entropy gain originating from the entropic EV effect is re- 
markably influenced by the overall shapes and sizes of the 
solutes. It is also largely dependent on the details of the so- 
lute polyatomic structure. An R12 monomer has disc-like 
shape with the upper-surface diameter D'~9d§. The stack- 
ing of two R12 monomers to form a dimer illustrated in 
Figure 6c leads to the water-entropy gain of ~170£b that is 
far larger than the value expected from Figure 6b. This re- 
sult arises from the close packing of the atoms constituting 
the interface regions. The decrease in the total EV brought 
by such packing is much larger than that brought by the 
contact of flat surfaces. 

The water-entropy gain upon each elemental process of 
the R12-P 16 binding is summarized in Figure 7. Each value 
given is the average taken over the five models of the com- 
plex structure. The gain upon the structural change of PI 6 
from the random-coil state to the compact one is ~30&b- 
Even for the structural change of such a small (N v = 12) 
polypeptide, the water-entropy gain becomes this large. The 
gain arising from the binding of PI 6 to one of the surfaces of 
the R12 dimer is ~75/: B - This value is roughly a half of the 
gain in Figure 6c, partly because P16 is smaller than R12. 
The total gain is ~105£ B (105 = 30 + 75). Thus, the binding 
in path II is accompanied by a very large gain of the water 
entropy. 

Significance of stacking of flat moieties 

In the R12:P16 complex, an indole ring of Trp 3 of PI 6 
stacks on a guanidine ring of R12 (see Figure 2). This is 
why the tt-tt stacking interaction was regarded as a signif- 
icant driving force for the binding (5). The it-it stacking 
interaction is often described by the van der Waals attrac- 
tive interaction. When the stacking of flat moieties occurs, 
however, the van der Waals attractive interaction gained is 
accompanied by the loss of the moiety-water van der Waals 
attractive interactions, and these two factors are compensat- 
ing. In other words, the stacking is required for canceling the 
loss. For this reason, in general, the stacking of flat moieties 
is often observed in a complex. The stacking is driven by 
a large gain of the water entropy as understood from Fig- 



ure 6b or c. Looking at the structural data for a complex 
without accounting for the water roles can mislead to the 
conclusion that in the R12-P16 binding, for instance, the 
binding is driven by the it-it stacking interaction described 
above. 

Significance of shape complementarity 

The entropic EV effect is responsible for the shape comple- 
mentarity often observed in a complex (2). For instance, 
it is obvious that the lock-key binding leads to a remark- 
able increase in the water entropy (see Figure 3a) and the 
molecular recognition is attributable to the lock-key shape 
complementarity which provides the largest decrease in the 
total EV. In the induced-fit binding, the lock structure ex- 
hibits a slight structural change so as to acquire the shape 
complementarity. In general binding processes, the shape 
complementarity between portions of the interface regions 
surely plays essential roles, and that at the atomic level also 
makes a remarkably large contribution to the water-entropy 
increase. 

On the structural plasticity 

The discussion on the significance of the electrostatic and 
shape complementarities made in the previous sections can 
also be applicable to the binding described by the lock-key 
or induced-fit model. We now discuss the molecular recog- 
nition accompanied by a global structural change of one of 
the solute molecules. 

Close packing of the constituent atoms of a solute 
molecule leads to a large increase in the water entropy. 
However, such overall close packing cannot necessarily be 
achieved. When it is unachievable, the entropic EV effect 
is not large enough to surpass the energetic dehydration 
and conformational-entropy effects with the result that the 
solute molecule takes a rather extended, flexible structure. 
Even when the close packing can be accomplished, if the so- 
lute molecule is not sufficiently large and the energetic de- 
hydration effect is substantial, it does not take a compact 
structure (68). P16 is a good example. 

A protein is driven to take a structure in which the back- 
bone and side chains are closely packed by the entropic EV 
effect. However, not all of the amino-acid sequences are 
amenable to such overall close packing. Even in cases where 
the overall close packing is not achievable, there are cer- 
tainly the portions that can closely be packed. When pref- 
erential close packing of these portions is made, the other 
portions cannot participate in the packing and therefore 
become disordered and flexible. Nevertheless, the water- 
entropy gain brought by such preferential close packing is 
considerably larger than that resulting from an overall loose 
packing (69). An example case is found for PrP c : its N- 
terminal half including PI and PI 6 is disordered and flexi- 
ble. 

PI 6 and PrP c can be referred to as 'a soft molecule' and 
4 a molecule possessing a soft portion', respectively. By con- 
trast, 'a rigid molecule' possesses a structure in which over- 
all close packing of the constituent atoms is successfully 
achieved, and even a slight change of its structure gives rise 
to an unacceptably large loss of the water entropy. R12 is 
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a good example of rigid molecules. It is possible even for a 
soft molecule or a soft portion of a molecule to construct 
a structure with overall close packing if the construction is 
made in concert with another molecule as a partner. In par- 
ticular, when the partner is rigid, the soft molecule or the 
soft portion may bind to the partner by changing its struc- 
ture in accordance with the partner structure, leading to 
the formation of a stable complex. Of course, there are not 
significantly many partners realizing such formation: only 
those which can realize it is successfully recognized. 

CONCLUSIONS 

We have investigated the binding of R12 and PI 6 as an 
important example of the new type of molecular recog- 
nition accompanied by a global structural change of a 
molecule upon binding to its targets. In this case, PI 6 ex- 
hibits the global change from an unstructured state to a 
compact structure while the structure of R12 remains al- 
most unchanged (5,23). Changes in thermodynamic quan- 
tities upon binding are calculated using the molecular me- 
chanics, hybrid method (30-37) in which the ADIET (38- 
42) is combined with the MA (43,44), and 3D-RISM the- 
ory (45^8). Molecular models are employed for water. The 
hybrid method and the 3D-RISM theory have been quite 
successful in studies on a variety of biological self-assembly 
processes (27-32,49-52) such as protein folding, association 
of protein molecules and receptor-ligand binding. In the 
present study, the hybrid method and the 3D-RISM the- 
ory are employed in calculating the hydration entropy and 
energy, respectively. This employment provides the most re- 
liable results even in a quantitative sense. 

The energy decrease due to the gain of R12-P16 at- 
tractive (i.e. van der Waals and electrostatic) interactions 
is almost canceled out by the energy increase originat- 
ing from the energetic dehydration effect (i.e. by the en- 
ergy increase caused by the loss of R12-water and PI 6- 
water attractive interactions plus the energy lowering aris- 
ing from the structural reorganization of water). The bind- 
ing is driven by the water-entropy gain that predominates 
over the conformational-entropy loss upon the global struc- 
tural change of PI 6. The water-entropy gain originates pri- 
marily from the overlap of EVs of R12 and PI 6 followed 
by the increase in the total volume available to the trans- 
lational displacement of water molecules in the system by 
the overlapped volume. Here, the EV is the volume of the 
space which the centers of water molecules cannot enter. 
The binding gives rise to an energy decrease arising from 
van der Waals attractive interactions between R12 and PI 6, 
but it is accompanied by an energy increase related to the 
loss of R12-water and P16-water van der Waals attractive 
interactions: the former is slightly larger and the net change 
(net change 1) is negative. The binding brings an energy de- 
crease arising from electrostatic attractive interactions be- 
tween R12 and PI 6, but it is accompanied by an energy in- 
crease related to the loss of R12-water and P16-water elec- 
trostatic attractive interactions; the latter is slightly larger 
and the net change (net change 2) is positive. The loss of 
R12-water and P16-water attractive interactions is quite 
large. However, about half of it is canceled out by the water 
reorganization energy (26,27), energy lowering due to the 



structural reorganization of water upon binding. The resul- 
tant loss becomes comparable with the gain of R12-P16 at- 
tractive interactions. Net changes 1 and 2 are also compen- 
sating. 

The picture of the binding can be summarized as follows. 
The binding is driven by the water-entropy gain which over- 
comes the dehydrations of R12 and PI 6 with the aid of a 
gain of R12-P16 van der Waals and electrostatic attrac- 
tive interactions. A loss of the conformational entropy of 
PI 6, which opposes the binding, is surpassed by the water- 
entropy gain. It is very important to assure the gain of 
R12-P16 attractive interactions during the binding process 
though the gain itself is not a driving force. We believe that 
this picture is applicable to general binding processes. This 
is why stacking of flat moieties, specific hydrogen bond- 
ing and electrostatic complementarity (i.e. contact of oppo- 
sitely charged groups) are frequently observed in the com- 
plexes (2). Stacking of flat moieties leads to a gain of inter- 
molecular van der Waals attractive interactions, and spe- 
cific hydrogen bonding and electrostatic complementarity 
lead to gains of intermolecular electrostatic attractive in- 
teractions. However, there is a case where it is not possible 
to accomplish both the water-entropy gain and the electro- 
static complementarity. When the water-entropy gain dom- 
inates, or equivalently, when the priority should be given 
to the water-entropy gain, the electrostatic complementarity 
becomes incomplete. Such incompleteness has been found 
in the Apt8-hFcl binding (16,25). 

The geometric characteristics (overall shapes, sizes and 
details of the polyatomic structures) of the solute molecules 
play crucially important roles in discussing the water- 
entropy gain. For instance, stacking of disc-shaped solutes 
brings a large decrease in the total EV followed by a cor- 
respondingly large gain of the water entropy. The gain be- 
comes larger as the solute size increases (29). These effects 
are enlarged when the interface atoms are closely packed. 
The reason why stacking of flat moieties is frequently ob- 
served in the complexes (2) can also be understood from 
this viewpoint. Further, it is obvious that the water-entropy 
gain becomes quite large when the shape complementarity 
occurs within the interface region. The most striking exam- 
ple is found in the lock-key binding (see Figure 3a). The 
induced-fit binding, in which the lock structure exhibits a 
slight structural change so as to acquire the shape comple- 
mentarity, is also driven by a large gain of the water entropy. 

In what follows, we describe our proposition concerning 
the mechanism of molecular recognition accompanied by a 
global structural change of a molecule upon binding to its 
targets. When a solute molecule is not sufficiently large, the 
water-entropy gain is often incapable of surpassing the ener- 
getic dehydration and conformational-entropy effects with 
the result that the solute molecule takes a rather extended, 
flexible structure (68). Even for a large solute molecule like 
a protein, when its overall close packing is not achievable, 
only the portions that can closely be packed are preferen- 
tially packed: the other portions cannot participate in the 
close packing and therefore become disordered and flexi- 
ble (69). (The water-entropy gain brought by such prefer- 
ential close packing is considerably larger than that result- 
ing from an overall loose packing.) These solute molecules 
can be referred to as 'a soft molecule' and 'a molecule pos- 



Nucleic Acids Research, 2014, Vol. 42, No. 11 6873 



sessing a soft portion', respectively. By contrast, a solute 
molecule possessing a structure in which overall close pack- 
ing of the constituent atoms is successfully achieved is 'rigid' 
because even a slight change of its structure gives rise to 
an unacceptably significant loss of the water entropy. It is 
possible even for a soft molecule or a soft portion of a 
molecule to construct a structure with overall close packing 
if the construction is made in concert with another molecule 
as a partner. In particular, when the partner is rigid, the 
soft molecule or the soft portion may bind to the partner 
by changing its structure in accordance with the partner 
structure, leading to the formation of a stable complex. Of 
course, there are not significantly many partners realizing 
such formation: only those which can realize it is success- 
fully recognized. 

Last, it is worthwhile to recapitulate the mechanism of 
the R12-P16 binding. For the binding, PI 6 changes its 
structure so that the R12-P16 interface atoms can closely 
be packed. At the same time, however, a decrease in R12- 
P16 intermolecular energy as well as that in PI 6 intramolec- 
ular energy (i.e. a gain of van der Waals and electrostatic at- 
tractive interactions between R12 and PI 6 and that within 
PI 6) is required to compensate for the energetic dehydra- 
tion effect. As for the R12-P16 intermolecular energy, the 
formations of stacking structure and of electrostatic com- 
plementarity lead to decreases in van der Waals and electro- 
static interaction energies, respectively. Since the stacking 
of flat moieties and the contact of unlike-charged groups 
bring decreases in the EVs, the formations are also favor- 
able in terms of the water-entropy gain. The water-entropy 
gain brought by close R12-P16 interface packing, with the 
help of the decrease in the sum of PI 6 intramolecular and 
R12-P 16 intermolecular energies, is large enough to surpass 
the energetic dehydration effect and the conformational- 
entropy loss of P16: the R12-P16 binding is achieved. If it 
was not large enough, the binding and the molecular recog- 
nition would not be realized. For the energetic component, 
the decrease in the sum of PI 6 intramolecular and R12- 
P16 energies, which is in favor of the binding, is consider- 
ably smaller than the opposing energetic dehydration effect. 
For the entropic component, by contrast, the water-entropy 
gain promoting the binding is far larger than the opposing 
conformational-entropy loss of P16. 

To demonstrate our proposition mentioned in the last 
paragraph but one, we intend to investigate the binding of 
another RNA aptamer to its targets upon which the RNA 
aptamer exhibits a global structural change. The binding of 
an intrinsically disordered protein to its targets is also an in- 
teresting subject to be tackled. The ultimate goal is to eluci- 
date all types of molecular recognitions including the 'lock 
and key' and 'induced-fit' binding models within the same 
theoretical framework in a unified manner. 
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